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FAST AND SCALABLE APPROXIMATION METHODS FOR FINDING 
MINIMUM COST FLOWS WITH SHARED RECOVERY STRATEGIES, 

AND SYSTEM USING SAME 

Cross Reference to Related Applications 

This application claims the benefit of United States Provisional 
Application Number 60/262,410, filed January 18, 2001. 

Field of the Invention 

The present invention relates generally to network routing, and more 
particularly, to fast and scalable approximation methods for finding minimum cost 
flows with shared recovery strategies and systems using these methods. 

Background of the Invention 

In a packet switching network, a packet of information is routed from a 
source to a destination by using a destination address contained in the packet. Packets 
of information are multiplexed onto network pathways. No particular path is reserved 
for data in a packet switching network, and a packet can generally take any path in the 
network. While a packet is not limited to any particular path, techniques exist for 
routing a packet based on certain criteria, such as speed, cost, distance, and Quality of 
Service (QoS). 

An additional type of network that is becoming common is a label 
switching network. A label switching network runs in conjunction with a packet 
switching network, and uses labels placed into packets in order to route packets. 
Normal routing procedures for packet switching are disabled. A "tunnel" is created to 
connect a source with a destination. The appropriate labels are selected for packets 
emanating from the source and entering the network so that the packets follow the 
tunnel and end up at the destination. Label switching offers the benefit of providing 
tunnels for packets. Consequently, packets should arrive at the destination at more 
regular intervals, which is important in certain time-sensitive applications such as 
real-time audio and video. 
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Packet switching and label switching networks both are helped by 
methods that route demands on these networks. These methods can be used real-time, 
meaning that changes are made to the network in order to more efficiently route 
demands on the network, or can be used to determine when or how the network 
should be upgraded to support new demands. Problems with methods used to solve 
network routing are that they are complex and time-consuming. A need therefore 
exists for techniques that solve these problems. 

Summary of the Invention 

Broadly, techniques for solving network routing within a 
predetermined error are disclosed. These techniques may be applied to networks 
supporting dedicated reserve capacity, where reserved capacity on links in the 
network is dedicated for a particular commodity (generally, a source and sink pair of 
computers), and shared recovery, where reserved capacity on links is shared amongst 
two or more commodities. These techniques use an iterative process to determine 
flows on each of the links in a network. As the flows for each link are modified, the 
costs for each link are also modified. The process continues to modify flows on links 
and costs for links until a network routing solution is determined that is within a 
predetermined amount of an optimal network routing. The network routing generally 
includes, for each link, a primary amount of flow that will normally flow on the link 
and a secondary flow that will be placed on the link should a failure occur in certain 
locations in the network. Once the network routing is determined, a process can 
configure the network in accordance with this routing, or a network administrator can 
use the network routing to determine when and how to upgrade the network. 

In a first aspect of the invention, a technique is disclosed that performs 
the following steps. Costs are set for each commodity, and primary and secondary 
(i.e., backup) flows are initialized. A commodity is selected and demand for the 
commodity is routed through the shortest path. There are well known techniques for 
determining the shortest path that minimizes cost according to weights associated 
with the links. Due to certain assumptions in the present invention on the feasible 
paths for flows, shortest path computations are relatively easy. One extra 
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complication, however, is that there are different edge costs associated to each failure 
mode. A failure mode comprises a given set of network elements. Typically, failure 
modes consisting of one link are generally modeled, but it may also be useful to 
protect against node failures or multiple failures. Network routing is achieved 
incrementally in rounds. Each round comprises sending flow for each commodity 
down its shortest paths (or pair of paths). These flows in turn affect the costs on the 
links, which in turn affects future choices of the shortest paths. After each round, a 
function is evaluated to check whether it is above a certain threshold. If so, the 
computation of an approximate solution has been completed. 

In a second aspect of the invention, the technique of the first aspect is 
performed subject to a budget constraint. In a third aspect of the invention, capacity 
is reserved when determining appropriate network routing. In fourth and fifth aspects 
of the invention, the techniques of the present invention are used with Multi-Protocol 
Label Switching (MPLS) and Multi-Protocol Wavelength Switching (MP1S), 
respectively. 

A more complete understanding of the present invention, as well as 
further features and advantages of the present invention, will be obtained by reference 
to the following detailed description and drawings. 

Brief Description of the Drawings 

FIG. 1 illustrates a block diagram of a system that used to explain 
network routing and recovery; 

FIG. 2 illustrates a block diagram of an exemplary system used to 
illustrate aspects of the present invention; 

FIG 3 is a block diagram of an exemplary network controller in 
accordance with a preferred embodiment of the invention; and 

FIG. 4 is a flow chart of a method of determining network routing 
without a budget constraint, in accordance with a preferred embodiment of the 
invention. 
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Detailed Description of Preferred Embodiments 

Broadly, techniques are presented for determining network routing. 
These techniques support dedicated reserve capacity, where reserved capacity on links 
in the network is dedicated for a particular commodity (generally, a source and sink 
pair of computers), and shared recovery, where reserved capacity on links is shared 
amongst two or more commodities. Basically, the techniques are given a set of 
demands as inputs. Each demand is a requested amount of capacity to be carried on 
the network. Demands are created between a source device and a destination device. 
The network lies betweens the source and destination devices and must be able to 
carry the demand, subject to certain criteria. One such criterion is generally some 
amount of extra demand so that communication between the source and destination 
can continue if a link failure or other failure occurs over the primary path chosen for 
the link. This extra demand will be routed over a secondary path. The outputs of the 
techniques include the determined primary and secondary demand for each link in the 
network. 

The techniques of the present invention may be used during real-time, 
which means that the primary and secondary demands determined as a network 
routing solution may be used to configure the network to actually route the 
determined demands. This is particularly true because the iterative methods of the 
present invention allow fairly fast convergence to a network routing solution, even for 
very large networks. Alternatively, the determined demands may be used to 
determine how or when to upgrade the network. For example, a network 
administrator may know that the current demand but may desire to know what the 
desired capacity or cost will be if the demand increases by a certain percentage. If the 
network administrator has data showing the approximate bandwidth increase per year, 
for example, for the network, the administrator can then determine when the capacity 
of the network will need to be increased based on the data. 

Before proceeding with additional detail about the aspects of the 
present invention, it is helpful at this point to describe an exemplary system and 
possible routing problems in the system. Both packet and label switching networks 
have to route packets from a source to a destination. FIG. 1 helps to illustrate routing 
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determination and possible problems. A system 100 is shown in FIG. 1. System 100 
comprises two source devices 110, 120, two destination devices 160, 170, a network 
170, and a network controller 180. Network 170 comprises a number of nodes 130-1, 
130-2, 130-3, 140-1, 140-2, 140-3, 150-1, 150-2, 150-3, and 150-4. In this example, 
packets originate at source device 110, travel through link 115, are placed onto the 
network 170 by node 130-1, are removed from the network 170 by node 130-3, travel 
through link 165, and are received by destination device 160. Similarly, packets 
originate at source device 120, travel through link 125, are placed onto the network 
170 by node 130-1, are removed from the network 170 by node 130-3, travel through 
link 175, and are received by destination device 170. For simplicity, paths taken by 
packets will be treated as one-way paths. 

In system 100, there are three possible paths from which node 130-1 
may select: (1) a path connecting nodes 140-1, 140-2, and 140-3 (collectively, "nodes 
140"), having links 145-1, 145-2, 145-3, and 145-4 (collectively, "links 145"); (2) a 
path connecting nodes 130-1, 130-2, and 130-3 (collectively, "nodes 130"), having 
links 135-1 and 135-2 (collectively, "links 135"); and (3) a path connecting nodes 
150-1, 150-2, 150-3, and 150-4 (collectively, "nodes 150"), having links 155-1, 155- 
2, 155-3, 155-4, and 155-5 (collectively, "links 155"). Each link has a certain 
capacity. Each node is generally a router or a device that performs the functions of a 
router. 

Routing over a packet switching network 170 is normally relatively 
simple. When node 130-1 receives packets from link 115 or link 125, the node 130-1 
determines how to route the packets based on the destinations, which are contained in 
the packets, and routing criteria, as discussed above. However, the routing criterion is 
usually solely shortest path, and most routers are oblivious to other criteria such as 
lowest cost and congestion. In the example of FIG. 1, it is assumed that the shortest 
path is through link 135-1, and packets from both source devices 110 and 120 are 
routed through this path. The node 130-2 will make a similar routing determination, 
again choosing the shortest path, which is through link 135-2. Consequently, routing 
tends to be a relatively simple affair, although more complex routing routines may be 
used, as discussed below. 
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Another important routing problem is providing backup paths in case a 
path through which data is currently being routed becomes unusable. For instance, 
packets from both source devices 110 and 120 might be routed through the path 
containing links 135. This path is called the primary path, and this path must, in this 
example, have enough capacity to support both source devices 110 and 120. For 
example, if source device 110 needs a capacity of 5 units and source device 120 needs 
a capacity of 7 units, then the path through links 135 must support at least a capacity 
of 12 units. The backup paths in this example are the paths through links 145 and 
links 155. These backup paths are called secondary paths, and capacity on these paths 
is reserved in case a failure occurs on the links 135 or nodes 130. In many systems, 
the capacity reserved on the secondary paths is at least the capacity on the primary 
paths. Therefore, in this example, 12 units of capacity must be reserved on the 
reserve paths having links 145 and 155. Assume that the reserved capacity on the 
path having links 145 is 5 units and the reserved capacity on the path having links 155 
is 7 units. In this situation, the system 100 can respond to a failure of an link 135 or 
node 130-2 by rerouting the packets from source device 110 through the path having 
links 145 and by rerouting the packets from source device 120 through the path 
having links 155. A system 100 or network 170 that provides reserve capacity for 
data is said to support recovery, and a system 100 or network 170 that reserves 
secondary capacity equivalent to the primary capacity is said to support 1-to-l or 1+1 
protection. 

The latter type of protection is fairly simple to implement in a network 
170 using light pathways as the transmission medium. For instance, node 130-1 can 
simply sense if there is light on the link 135-1 and route the light pathways from link 
135-1 to link 145-1 (for source device 110) and link 155-1 (for source device 120) if 
there is no light on link 135-1. While this type of protection is relatively simple, it 
takes forethought to configure the network 170 to automatically transfer capacity 
during failure. Additionally, the network 170 is usually limited to the predetermined 
primary and secondary pathways, and other secondary pathways, for instance, cannot 
be chosen without manual intervention. Finally, 1+1 protection reserves too much 
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capacity and will cause a network to have to add capacity sooner than what would be 
necessary if less capacity is reserved. 

One way for less capacity to be reserved is to allow secondary paths to 
be shared. For instance, assume the links 135 only support 5 units of capacity, while 
links 155 only support 7 units of capacity. Source device 1 10, which needs 5 units of 
capacity, is routed by node 130-1 through link 135-1, and source device 120, which 
needs 7 units of capacity, is routed by node 130-1 through link 155-1. If it is assumed 
that only one failure of a path will occur at a time, such that either the path having 
links 135 or the path having links 155 will fail but not both, then the reserved capacity 
can be 7 units of capacity. This allows the reserved path having links 145 to have 7 
units of capacity reserved on the path, instead of the 12 units needed if both the path 
having links 135 and the path having links 155 fail at the same time. This recovery 
system is a shared recovery system. 

A problem with shared recovery systems is that the analysis of a 
network can be quite complex. There are mathematical solutions that will determine 
an optimum routing for both primary and secondary paths for a number of sources and 
destinations. However, these mathematical solutions tend to take quite a while to 
solve and they tend to fail when the number of nodes, sources, and destinations 
exceeds a certain amount. 

Moreover, as discussed above, many networks remain relatively fixed 
in their ability to have nodes switch paths. Nonetheless, new routers are becoming 
more prevalent, and the new routers support certain protocols that allow a remote 
device, such as network controller 180, to more easily configure routers to 
automatically switch to secondary paths when primary paths fail. Two important 
protocols that may be used to perform configuration functions are Multi-Protocol 
Label Switching (MPLS) and Multi-Protocol Wavelength Switching (MPAS). The 
latter is basically a specialized version of the former. 

A network that supports MPLS is one type of label switching network. 
A label switching network runs in conjunction with a packet switching network, and 
uses labels placed into packets in order to route packets. The exact technique used to 
place the label into a packet depends on the underlying packet switching protocol 
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being used. For example, the location for a label placed into a packet for a network 
using the Internet Protocol (IP) may be somewhat different than the location for a 
label for a packet for a network using Asynchronous Transfer Mode (ATM). 

Regardless of the actual technique used to support labels in packets, 
normal routing procedures for packet switching are disabled in a label switching 
network. A "tunnel" is created to connect a source with a destination. The 
appropriate labels are selected for packets emanating from the source and entering the 
network so that the packets follow the tunnel and exit the network at the destination. 

Even with these new protocols, the problem of determining how to 
efficiently and quickly route all of the requested capacity in a network while still 
providing shared recovery for large networks is still quite important. A need therefore 
exists for techniques that solve this problem. 

Aspects of the present invention solve this problem by providing 
techniques that quickly converge to a network routing solution that is within a 
predetermined error from an optimal network routing solution. 

The present invention is generally applicable to any technology or 
protocol that is path-oriented (as opposed to being link oriented) and that can perform 
explicit recovery, which means that the technology can transfer flow from one path to 
another. For example, Multi-Protocol Label Switching (MPLS) and Asynchronous 
Transfer Mode (ATM) are suitable technologies for implementing aspects of the 
present invention. The present invention is less applicable to Internet Protocol (DP), 
because IP is link (and not path) based. It should be noted, however, that MPLS and 
Multi-Protocol Wavelength Switching (MP^S) can be implemented over IP. 

It should also be noted that the techniques of the present invention are 
scalable. Many techniques for determining network routing fail when the number of 
nodes passes a certain value. It can be shown that the present invention can be used 
with a very large number of nodes. 

Broadly, the setting of shared protection is as follows. A network 
G = (V,E) is considered with a capacity u(e) and cost c(e)for each link e. The 
network G is modeled as an undirected graph. In graph theory terms, G is usually 
called a graph and a link e is called an edge. There is also a set, K , of commodities 
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(or "demand-pairs"), each commodity k e K specified by a source-sink pair (s k ,t k ), 
demand d k , and a collection of pairwise link-disjoint paths, each of which 
connects s k to t k . That is, no two paths in have any common link. Any single 
link of G may fail at any time, thus rendering all paths that pass through it temporarily 
useless, until the link is made live again. All flows, however, for commodity k must 
be routed on one of the paths in A^ . 

One objective of the present invention is to design working and 
restoration capacity so that under certain failure states, there is still enough capacity to 
meet demand. Moreover, the long term operational cost of these traffic flows is to be 
minimized. To do this, statistical information on the relative likelihoods of the 
various links is assumed, and as a result, paths in A := \J keK A k failing (a path fails 
when one of its links fails). This information is updated as further failures occur, with 
old information discounted in order to have an accurate picture of the existing 
network. For each commodity k specified by its source-sink pair, ( s k , t k ), what must 
be chosen is (a) appropriate primary flows on paths in A k , and (b) how much flow is 
transferred from one path to another in the event of a failure. This important 
requirement that there be sufficient flow even under any link failure is formalized 
below. Furthermore, under the normal state of no failure, as well as under the failure 
of any single link, the capacity of any link, e , should not be exceeded: the total 
demand using it should be at most u(e). An objective is to design the paths, flows, 
and flow transfer strategies under failure, in order to minimize the long term average 
cost of operating the network. This average cost is just the expected long term cost of 
operating the network under a given collection of primary and backup paths, where 
the random variables are the failures of the various paths. 

Referring now to FIG. 2, an exemplary system 200 is shown. System 
200 comprises k sources 210-1 through 210-k (collectively, "sources 210"), k 
destinations 240-1 through 240-k (collectively, "destinations 210"), network 270, and 
a network controller 260. Network 270 comprises a plurality of nodes, of which 
nodes 220-1 through 220-4 (collectively, "nodes 220"), nodes 230-1 through 230-3 
(collectively, "nodes 230"), and nodes 250-1 through 250-3 (collectively, "nodes 
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250") are shown. Although only nodes 220, 230, 250 are shown, generally a network 
270 using techniques of the present invention will contain many such nodes. The 
nodes that are shown are merely exemplary. The nodes are interconnected through 
links (also called edges). For instance, node 220-1 is connected to node 220-2 
through link 221-1. Each of the links 221-1 through 221-3 (collectively, "links 221"), 
link 232-1, link 232-2, links 251-1 and 251-2 (collectively, "links 251"), links 2311 
and 231-2 (collectively, "links 231") have a capacity u(e) and cost c{e) . 

Each commodity k, which is specified by a source-sink pair (s k9 t k ) 
such as source-sink pair (210-1, 240-1), has a demand d k . In the example of FIG. 2, 
the source-sink pair (210-1, 240-1) is routed over a path comprising nodes 220 and 
links 221. The source-sink pair (210-1, 240-1) has demand d x . Similarly, source- 
sink pair (210-k, 240-k) is routed over a path comprising nodes 230 and links 231. 
Capacity is reserved on links 222-1, 250, and 222-2 in order to route the demand d x if 
one of the links 221 fails (or, equivalently, if a node 220-2 or 220-3 fails). Similarly, 
capacity is reserved on links 232-1, 250, and 232-2 in order to route the demand d k if 
one of the links 231 fails (or, equivalently, if a node 220-2 or 220-3 fails). This 
example is a shared recovery network, although the techniques of the present 
invention may also be used to determine network routing for dedicated reserve 
capacity, as discussed below. 

In order to route the demand, network controller 260 uses a number of 
methods to determine how much demand should be routed on each link (called the 
primary capacity) and how much capacity should be reserved on each link in case a 
link on which demand is being routed fails (called secondary or reserve capacity). 
The network controller 260 then acts to configure nodes 220, 230, and 250 of the 
network 270 in order to bring about the determined primary and secondary capacities. 

In a preferred embodiment, all or a portion of network 270 employs 
label switching. This label switching is generally controlled through Multi-Protocol 
Label Switching (MPLS), which can include Multi-Protocol Wavelength Switching 
(MPAS), and its control protocols. A good introduction to MPLS and MP/IS is given 
in Jarram and Farrel, "MPLS in Optical Networks: An Analysis of the Features of 
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MPLS and Generalized MPLS and Their Application to Optical Networks, with 
Reference to the Link Management Protocol and Optical UNI/' white paper from 
Data Connection, 100 Church St., Enfield, UK, http://www.dataconnection.com (Oct. 
2001), the disclosure of which is hereby incorporated by reference. 

As discussed briefly above, MPLS is a new and promising routing 
scheme that aims to provide end-to-end quality of service (QoS) in otherwise 
best-effort packet networks. The attraction of MPLS is due to its combined high 
performance and efficient layer-2 switching with the intelligence of layer-3 Internet 
protocol (IP) control. By creating and pinning down tunnels, or Label-Switched 
(explicit) Paths (LSPs) with well-defined bandwidths, MPLS makes it possible to 
carry QoS-sensitive applications such as real-time audio and video in an IP network. 
See Rosen et al., "Multiprotocol Label Switching Architecture, Internet Engineering 
Task Force Request for Comments 3031 (IETF RFC3031) (Jan. 2001), the disclosure 
of which is hereby incorporated by reference. By identifying a wavelength with a 
label, this protocol has been extended to provide a complete IP-based mechanism to 
set up, manage, reconfigure and tear down wavelength tunnels in the optical core of 
networks. MP/IS, as this protocol is known, makes it possible to add intelligence and 
reconfigurability to the traditionally static optical core of networks. See Ashwood- 
Smith et al., "Generalized MPLS - Signaling Functional Description," IETF draft 
(Nov. 2001) and IEEE Communications Magazine (Dec. 1999), the disclosures of 
which are hereby incorporated by reference. Intelligent routing and reconfigurability 
are the key requirements of the emergent optical Internet. 

To provide these desired capabilities in an optical network, traffic 
loads between distinct source-destination pairs are measured and abstracted, and the 
emerging set of demands aggregated, groomed and source-destination loads with 
bit-rates at or close to a wavelength are created. These are the set of loads that need 
to be provisioned over an existing network, with both primary and backup 
wavelength-switched paths (also known as A-Switched Paths or ASP tunnels) for 
protection. Since only a small number of backup tunnels are typically invoked at a 
time, there is considerable room for their capacity sharing. For instance, if a network 
is to protect against single-link failures and if primary tunnels P x and P 2 do not share 
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any links, then their respective backup paths can share As (since P 1 and P 2 should not 
fail simultaneously). Efficient management of backup paths such as MP^S tunnels in 
a network, can result in up to fifty percent increase in the throughput of a network. 
For the latter, see Davis et al., "SPIDER: A Simple and Flexible Tool for Design and 
Provisioning of Protected Light-Paths in Optical Networks," Bell Labs Technical 
Journal, Vol. 6, No. 1 (Jan. 2001), the disclosure of which is hereby incorporated by 
reference. 

Referring now to FIG. 3, an exemplary network controller 160 is 
shown interacting with network 270 and Digital Versatile Disk (DVD) 370. Network 
controller 160 comprises a processor 310, a memory 320, a media interface 330, and a 
network interface 340. Memory 320 comprises a network routing determination 
process 345, routing information 350, and a network configuration process 360. 
Network interface 340 communicates with network 270 and with processor 310 and 
memory 320. Note that many networks will contain other layers of software and 
hardware that are not shown but are known to those skilled in the art. For example, 
the Internet Protocol (IP) contains multiple layers, some of which are used to access 
hardware and others are strictly software. Media interface 330 is used to allow 
processor 310 to access DVD 370 or other types of media, such as hard drives or 
removable drives or optical storage means. 

Network routing determination process 345 uses demands for 
commodities and determines appropriate routing for links. Network routing 
determination process 345 uses an s -approximation technique, which means that the 
optimal network routing is not determined. Instead, an approximation to the optimal 
network routing is used, and this approximation is within s away from the actual 
network routing. By making s very small, the techniques of the present invention 
may be used to come arbitrarily close to the actual network routing such that, in real- 
world terms, the e -based and actual network routings are indistinguishable. 
However, with decreases in e , there are corresponding increases in processing time. 
Technologies such as the Internet and IP can experience rapid changes in demand, and 
it is beneficial to use higher values of e in order for the methods of the present 
invention to quickly converge to a solution. In addition, one may compute, ahead of 
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time, an upper bound on the number of iterations require to obtain the desired 
accuracy. This gives an estimate of how long to expect the algorithm to run. 
Moreover, current Internet routing techniques are poor in this regard, and are often 
30-40 percent away from the optimum network routing for a particular set of 
commodities and demands. 

The routing information determined from network routing 
determination process 345 is stored in routing information 350, and this information 
will usually comprise a list of links, how much demand should be routed over each 
link, and how much demand should be reserved on each link. The reserved capacity 
on each link can be shared capacity, dedicated capacity, or a combination of both of 
these. Routing information 350 may also contain additional data, such as cost for 
each link. 

Network configuration process 360 uses the routing information 350 to 
configure network 270 to support the primary and secondary flows of the routing 
information 350. As is known in the art, routers generally contain routing tables that 
are used to route incoming packets to outgoing ports. These tables can be modified to 
set up the network 270 in accordance with the routing information 350. Similar tables 
are used for MPLS and MP AS, and these tables may be changed through certain 
control protocols. See, for instance, the Link Management Protocol section of Jerram, 
which has been incorporated by reference above. Additionally, the routing 
information 350 may be simply output to a screen or file in a manner suitable for a 
network administrator to view and determine network routing. 

As described in detail below, network routing determination process 
345 can contain budgeting criteria or have no budgeting criteria. Network routing 
determination process 345 can employ shared recovery, where no reserve capacity is 
dedicated, or employ dedicated reserve capacity for each commodity. In FIG. 4, a 
method is disclosed for determining network routing without a budget constraint, and, 
in FIG. 5, a method is disclosed for determining network routing with a budget 
constraint. Other methods are discussed below. 

As is known in the art, the methods and apparatus discussed herein 
may be distributed as an article of manufacture that itself comprises a computer 
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system-readable medium having computer system-readable code means embodied 
thereon. The computer system-readable code means is operable, in conjunction with a 
device such as network controller 160, to carry out all or some of the steps to perform 
the methods or create the apparatuses discussed herein. The computer system- 
readable medium may be a recordable medium (e.g., floppy disks, hard drives, 
memory cards, or optical disks, such as DVD 370, which is accessed through medium 
interface 330) or may be a transmission medium (e.g., a network comprising fiber- 
optics, the world-wide web, cables, or a wireless channel using time-division multiple 
access, code-division multiple access, or other radio-frequency channel). Any 
medium known or developed that can store information suitable for use with a 
computer system may be used. The computer system-readable code means is any 
mechanism for allowing a computer system to read instructions and data, such as 
magnetic variations on a magnetic medium or height variations on the surface of a 
compact disk, such as DVD 370. 

Memory 320 configure its processor 310 to implement the methods, 
steps, and functions disclosed herein. Memory 320 memories could be distributed or 
local and the processor 310 could be distributed or singular. Each memory could be 
implemented as an electrical, magnetic or optical memory, or any combination of 
these or other types of storage devices. Moreover, the term "memory" should be 
construed broadly enough to encompass any information able to be read from or 
written to an address in the addressable space accessed by a processor 310. With this 
definition, information on a network (e.g., wired network 270 or a wireless network) 
is still within a memory, such as memory 320, because the processor, such as 
processor 310, can retrieve the information from the network. It should be noted that 
each distributed processor that makes up a distributed processor generally contains its 
own addressable memory space. It should also be noted that some or all of network 
controller 160 can be incorporated into an application-specific or general-use 
integrated circuit. For example, the network routing determination process 345 may 
be made into an application-specific integrated circuit. 

The problem of determining flow in a graph, such as system 200 of 
FIG. 2, is now modeled more formally. A graph, G = (V,E), is considered with edge 
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capacities u(e) , costs c(e) ; a set of commodities K, each commodity k specified by 
a source-sink pair (s k ,t k ) with demand d k ; and a set of network states Q. Each state 
corresponds to a potential set of network element failures. It is assumed that there is a 
state, q 0 eQ, that is the normal state when no failures have occurred. With each 
commodity, k , is a corresponding set of disjoint paths A k from s k to t k . By 
"disjoint," it is meant that no two paths in A k have any common link. What is sought 
is enough reserved capacity to route all demands d k on paths in A k , while respecting 
edge capacities so that, if some network failure occurs affecting a subset of paths in 
A : = U ie ^ A k , there is enough reserve capacity to reroute any disrupted flow. The 
cost of a solution is the sum over all links e e E of their costs, c(e) , times the 
expected amount of flow on that edge e (over time). 

First, the best fractional solution possible is determined. To do this, 
the problem is formatted as a linear program. Variable x(P) denotes the amount of 
flow on pathP. Variable y p '(P) denotes the amount of flow that is rerouted from 
path P' to path P when a link on path P' fails. Variable y p '(P) is defined if and 
only if the following are true: (a) P * P' and (b) Pand P'are paths for the same 
commodity. When failure q occurs, the set of affected paths is denoted by A(q) (so 
A (tfo) = 0)- Let Q k c Q denote the set of all failures that affect some path in A k . 
The formulation used herein assumes that |A 4 r\ A(q)\< 1 for all k,q . That is, it is 
assumed that a failure does not affect more than one path in A k . 

For each path P e A , let k(p) denote the steady-state proportion of 
time for which P is in non-failed mode. As mentioned above, these probabilities 
*(P) are contmuously updated and learned. To model the objective function, assign, 
for each commodity k and paths P,,P 2 eA 4 , a cost c(P lt P a ) as follows: 
c(P,P)=K(p)£ eeP c( e ),andif P,*P 2 , c(P 1 ,P 2 )=(l- K (p 1 ))Z fleP2 c{e). Thus, c(P,P) 
is the (long term) expected cost of primary flow on path P , and c(P„ P 2 ), for P, * P 2 , 
is the (long term) expected cost of backup flow on P 2 from P x . Henceforth, set 
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c(p) := c(P, P) . So, by the linearity of expectation, the objective function of long term 
average cost is: 



k PeA* 



c(p>(p)+ Ecfopy^p) 



(1) 



The problem, called Minimize Operational Cost (MOC), may now be 
formulated as the following Linear Program (LP): 

mm EC(x, y) 

subject to the following: 



2>( p ) >d k VkeK 



PeA, 



2 WP) + /'(P)] >d k VF'eA^VkeK 

PeA A \P' 



I s 

*ei: P:eeP 



*(P) + J] 



<w(e) VeeE,VqeQ 



x,y >0 



The first constraint above says that under no-failure conditions, the 
total demand d k should be met for each commodity k . The second constraint says 
that this demand-satisfaction should hold even if any path P' e A k fails. The third 
constraint is that under any network state q (including the no-failure state q 0 ) 9 the 
total flow on any link e should be at most its capacity. The final trivial constraints 
are also shown, which force all variables to be non-negative. 

This linear program (MOC) will be very large for moderate sized 
networks and a moderate number of failure scenarios. This linear program may be 
used to model how to route traffic in the Internet. Since the Internet changes 
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frequently, this problem needs to be solved quickly. Exact linear programming codes, 
which will determine an exact solution to the problem, are too slow. Since the 
networks normally dealt with typically have reserve capacity, approximate solutions 
are suitable. Aspects of the present invention adapt approximate LP techniques to 
obtain a provably good solution to this linear program. For multicommodity flow 
problems, s -approximation methods have proven to be computationally effective in 
practice. Thus, these methods can be adapted to solve this problem. 

1. Maximum Concurrent Recovery Problem 

To start, it is determined if there is a feasible solution. Thus, the cost 
objective function is exchanged with the objective to maximize the minimum fraction 
of each demand that is met. This involves modifying the demand constraints as well. 
A new variable X is introduced that represents the minimum fraction of demand 
routed, over all commodities, and the variables f(e) are replaced with the given 
capacities u(e) . 

Since it is assumed that \A k n A(q)\ < 1 for all k,q, the second set of 

inequalities in (MAC) can be simplified so that there is precisely one inequality for 
every P'gA. For clarity, this will be written as V P' e A k , V k e K . The summation 

in the third set of inequalities in (MAC) is also broken down by commodities. This 
linear program is then as follows (note that the maximization of X formulation is 
copied from the (MOC) above). 

max A 

subject to the following: 

£x(P) >ld k \fk^K 
£[x(P)+ /'(P)] > Xd k VP'e A k ,Vk e K 

PeA 4 \P* 
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keK P:e«=F 

PeAjAAl 

7, X > 0 



*( p )+ IX(p) 

FeA k nA{q) 



<u(e)VeeE,VqeQ 



The dual of this linear program is 

min5>(e)£V<>) 
X 2>*(e) * ^ V * e VP' e A t , P e A t - {P'} 

q:J?'eA(q) eeP 

?:P«A(?) eeP P'eA t -{P} 
£eA Arsis: P'eA k 

h, w 9 z > 0. 

It is beneficial to define new dual variables, Z, = z. + Y w p ' 
Intuitively, the dual variable represents the marginal cost of link e when the 

network is in state q . Following this intuition, w r is the cost of the cheapest 
recovery path that could be taken if path P' were to fail. This means Z k is the cost of 
the cheapest backup path for commodity k 9 including in this the cost of the 
appropriate backup path in case of failure. The dual linear program may then be 
rewritten as follows: 

min£ii(e)£A*(e) 

ee£ q*Q 

Y 2> ? 0) ^w p ' V*eA:,VP'*P',P'eA t ,PeA t 

q:P'eA(q) eeP 

q:¥<£A{q) eeP 
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Z d * z * * 1 



keK 



x,y 9 A>Q 

1.1 An Exemplary Method for Determining an Approximate Solution 

A preferred method for determining an approximate solution to 
network routing will be discussed in the following manner. First, the method will be 
described in succinct terms and through a figure. Next, a more elaborate pseudocode 
version of the method will be presented. Finally, analyses and proofs will be 
presented. 

Referring now to FIG. 4, a method 400 is shown for determining an 
approximate solution to network routing. Method 400 is performed by a network 
controller, although other computer systems may also perform the method. First, the 
method 400 starts with by setting a dual solution h q (e) = S/u(e) (step 410), for an 
appropriately small 8 , which will be given later as a function of s , m , and | Q | , and 
a primal solution x = y = 0 (step 415). Given any vector A, one can obtain w by 
setting w ? ' equal to the minimum over all possible backup paths P for P r of 
5^c,)21p**<*)- Define as * g (P)-Z-* f W. Then define Z k as 

follows. 



Z. = min 



mm 



(2) 



Note that what has been ensured is that all the dual equations, but that for X , are 
satisfied. To satisfy the X equation, simply divide all dual variables by 
a + ^Y^feK ^k^k to obtain a feasible solution that satisfies J] d k Z k = 1 . 

This exemplary method cycles through the commodities. For each 
visit to commodity k (step 420), the method will push d k flow along paths in . 

This pushing is preferably performed on a "cheapest" pair of paths P 1? P 2 , which is a 
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pair for which the primary flow on P 1 plus the restoration flow on P 2 is minimized 
(step 425). The method then pushes along P 1 an amount u determined as the 
minimum of three quantities. Let u(P) denote the bottleneck capacity of 
P : w(P) := min eeP u(e) . Then u is set equal min{w(P 1 ),w(P 2 ) > c/ jfc } . The method 
continues by pushing u along path P 2 and updates the primal and dual variables 
x, y, h as described in the pseudocode below. In particular, costs are updated (step 
430) for each failure mode. The method 400 repeats this process until d k units of 

flow have been pushed along primary paths, then proceeds to the next commodity 
(step 440 = YES). In other words, if the amount being pushed through a primary path 
is less than the demand needed by the commodity (step 435 = YES), then demand is 
again routed (step 420) and costs are updated (step 430). Otherwise (step 435 = NO), 
another commodity is selected (step 440). In practice, it is beneficial to start the 
method by scaling all capacities so that max^ d k < min e u(e) . Thus, u is in effect 

always equal to d k , and the method sends flow exactly once per commodity per 
iteration. 

The method stops when an e -approximate solution is found. An 
e -approximate dual solution is be used to verify the e -approximate primal solution. 
It will be shown below that a sufficient criterion for this is when the value of the dual 
objective function is at least 1. The dual objective function for this example is 
defined above as ^u(e)£h q (e) , which is being minimized in the linear program. 

The method ends when the value of the dual objective function is at least one (step 
445 = NO), else another commodity is selected (step 445 = YES). Given h q for each 
q , define D(h) as the value of the corresponding dual objective function. For an 
integer i, define D(z) to be the dual objective function value at the beginning of the 

/* iteration; whenever the term "iteration" is used below, the outer "while" loop in 
the method is being described. Method 400 in pseudocode appears below. 



Initialize h q (e) = S/u(e) VeeE,Vq<=Q 
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Initialize x = 0,y = 0 
while D(h)<\ 

for k = \to \K\ do 



while D(h) < 1 and d' > 0 

P 15 P 2 <- shortest disjoint path pair in A^that 

minimizes Eq. (2) 
w <-min{c/',«(P 1 ), w (P 2 )} 

^(P^jcCP^ + w 

/'(P 2 )^/'(P 2 ) + " 

d'^d'-u 

for 9 e g 

if Pj g A(g) do 

SU 

VeeP l9 h q (e)<r-h q (e)e <e \ 

else do 

eu 

Vee¥ 2 ,h q (e)<^h q (e)e u(e) . 

end for 
end while 

end for 
end while 



The procedure stops at the first iteration t for which D(t + 1) > 1 . In the first t - 1 
iterations, for every commodity k , there has been routed (t - V)d k units of flow along 
primary paths and backup paths. This flow may violate capacity constraints. By 
scaling appropriately, the flow can be made feasible. 

A lemma and a theorem are now discussed. 
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Lemma 1.1: k> . 

Proof: Consider link e . Consider the third primal inequality for some 
fixed (q,e). Whenever *(P) is increased for some (£,P) pair such that 
Pe A * \A(£) by some value u, the first "for" loop inside the inner while loop 
multiplies h q (e) by e m!uie \ Whenever >> P (P) is increased for some (£,P,P), such 
that P e A k \ A(q) and Pg A(q) n by some value u , the second "for" loop inside 
the inner while loop multiplies h q (e) by . Thus, h q (e) increases by a factor of 

e s with every u(e) units that use link e in state q . Since h q (e) = S/u(e) initially 
and is less than llu(e) at the end of iteration t-l , the flow through e is at most 
u(e) • log eS 1 / d at the end of iteration t-l. Thus, dividing the primal solution at the 
end of iteration t-l by log e£ lid gives a feasible solution. This ends the proof of 
Lemma 1 . 

It should be noted that other exponential functions, instead of e*" /a(e) ? 
may be used to modify h q (e) . However, it is likely that the function used to modify 
h q (e) must be exponential in order for the method to provably converge to a solution. 

Theorem 1.2: Suppose the optimal primal (and hence dual) solution 
value is at least 1. If this is not true, it is easily achievable by scaling the numerical 
values in the primal and dual problems, as pointed out in Garg et al., "Faster and 
Simpler Algorithms for Multicommodity Flow and Other Fractional Packing 
Problems," IEEE Symp. on Foundation of Comp. ScL, 300-309 (1998), the disclosure 
of which is incorporated herein by reference. The primal solution at the end of the 
(t-l)st iteration divided by log ee 118 is an e' -approximate solution, for £ f =ce, for 
an appropriate constant c> 0 , and appropriate choice of S . 

Proof: Let h t be the length functions at the beginning of the i th 
iteration. Let h i k be the length functions before routing the k th commodity in the i* 
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iteration. Let h i k s be the length functions before routing the s th pair of paths for 
commodity k in the /* iteration. 

Let Z k (h) be the value of Z k computed using the given values of h 
variables before the rescaling necessary to make the dual feasible. Equivalently, 
Z fc (h) is the value of Eq. (2), where P 1? P 2 are the shortest disjoint-path pair which the 
method would select when presented with the length functions given. Let a(h) equal 
YjkeK d k Z k( h ) • The variable a is defined so that h divided by a{h) is a feasible 
dual solution with value D(h)la(h) . 

After flow u has been routed on P } and P 2 as the s th pair of primary 
and backup paths for commodity k, the bound shown below occurs for D(h iks+l ). 
The bound is presented below using the following notational conveniences: 

Smfaqy.eeP^tAfa)} (3) 
S' = {(e 9 q):eeP 29 P 2 eA(q)} (4) 



Using this notation, the bound is then the following: 



e *Q q^Q 



= D(h^)+ 2i«(e)^(e)( e e »« -1) + 



Z ^)K + * 2J ^)> (5) 
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where inequality (5) uses the fact that e a <l + a + a 2 for 0<a<l. For notational 
convenience, let 

eePj ^ «A (9) eeP 2 ^Pj eA(? ) 

Thus, from Eq. (5), the following can be discerned: 

2)(^)^)+«(l + 8h (6) 

Arrt ) * ^A, ) + "4 + ). (?) 

where inequality (6) uses the fact that u is upper-bounded by min{w(P 1 ) 5 w(P 2 )}, and 
inequality (7) follows from the definition of Z k in Eq. (2). 

Note that the length functions are non-decreasing. Then over all the 
path pairs chosen in the process of pushing d k flow for commodity k 9 one can 
conclude the following: 

D(h iMl ) < D(h, k ) + s(l + e)d k Z k (h iM1 ) 

Each iteration pushes d k for every commodity. Extending this 
inequality over all values of k and again noticing that the length functions (and thus 
Z k ) are nondecreasing, it is possible to conclude the following: 

D(i + 1) < D(i) + e(l + e)^d t Z k (h M ) < D(i) + e(l + e)a(h M ). 

Thus, a crucial inequality has been established here that may be used as in the analysis 
presented in Garg (incorporated by reference above), Section 5. Let J3 be the optimal 
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dual (and thus primal) objective function value. Since the initial objective function 
value is at most m(\ Q | +l)d , through a very similar analysis to that in Garg, one can 
conclude that if /3 > 1 and if the method terminates in iteration t then 

P < g(l + g) 

Thus, setting S = (m \Q |+1)/(1 -e)) _1/ * implies (using the same arguments as in 

Garg) that this gives a provably good approximate solution. This ends the proof of 
Theorem 1.1. 

Run time analysis: Let n denote the number of nodes in G (as 
mentioned above, m denotes the number of links in G . Also recall that \Q\ = m + 1 . 

Each augmentation requires o(m\Q\\A\ 2 (log«) 0(l) ) time to find a shortest path. 
Similar to the analysis in Garg, Section 7, the number of iterations before the stopping 
criterion is met is o(e 2 (\A\ + m \Q |Xlog«) 0(l) ). However, in applications of the 
present invention, typically d k <u(P), so that the shortest path calculation at each 
step is also a minimum cost flow computation. Since minimum cost flow 
computations require fewer iterations, this gives an improved bound on the run time 
of o(s 2 m | Q || A | 3 (log«)° (l) ). For support that minimum cost flow computations 
require fewer iterations, see, for example, Garg, Section 5, or Grigoriadis et al., "Fast 
Approximation Schemes for Convex Programs With Many Blocks and Coupling 
Constraints," Society of Industrial and Applied Mathematics (SIAM) J. on 
Optimization, 4:86-107 (1996), the disclosure of which is hereby incorporated by 
reference. 

2. Concurrent Recovery with Budget Constraint 

One motivation for studying the problem in the preceding section is to 
develop a framework for quickly obtaining feasible solutions to (MOC). In fact, 
network designers are often interested in integer solutions where every commodity is 
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assigned one regular path and one backup path. This latter problem is NP-hard 
although one can obtain near-optimal solutions by rounding a solution to the linear 
program relaxation, e.g., one of the a -approximate solutions. 

This section addresses the full version of the problem (MOC) where 
the method of the previous section is modified to reserve bandwidth on links (not 
exceeding their capacity) which supports a routing and recovery solution as given in 
the previous section. The techniques of the previous section are only marginally 
suitable to obtain e -approximate solutions to (MOC). Instead, in this section, an 
alternate approach is considered. Instead of searching for the minimum cost 
bandwidth, the view is taken in this section that bandwidth is paid for as the method 
progresses. In this setting, backup capacity is paid for only as often as failures dictate 
that this capacity be purchased. In this sense, a minimum cost multicommodity flow 
and recovery problem may be formulated as follows. 

Assume that, for each commodity k and paths P 15 P 2 e A k , there is 
given a cost c(P 1? P 2 ) whose interpretation is as follows. If V Y =P 2 , then this is the 
per-unit cost of sending flow on the path. Otherwise, this is the per-unit cost of 
sending backup flow for Pj on path P 2 . Note that these constants may be hard-wired 
in advance to account for known statistical information about network failures. For 
instance, if it is possible to determine a value &(P) for each path P which indicates 
the percentage of time P is in a non-failed state, then setting 
c(P 1? P 2 ) = (1 - £( p ))21 € p c ( e ^ represents th e tota l cost of backup flow on P 2 for Pj . 
Similarly, c(P, P) = £(P)X ee p c ^ represents the total cost of primary flow on path 
P . Henceforth, set c(P) := c(P,P) and this notation should not be confused with its 
prior meaning. This new objective function is then: 



minX(Z c ( P M p ) + Z *(Pi>P)y Pl (P))- (8) 
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Lemma 2.1: Let (x\y*,f*) be a solution that minimizes ^/(e)c(e), 

and let (3c, y, f) be a solution that minimizes Eq. (8). If there is a constant 1 < £, that 
c(P)<£c(P') for all pairs P,P r ei> and all choices of k<aK, then 
X/(e)c(e)<(l + Z)^ e f*(e)c(e). 

Proof: The following can be shown. 

I7(«Me) * I>(P)[*(P) + E^CP)] 

e P P r 

<2<p)^*(P)+Z/ F ( p )] 

p p' 

<a+a2>(py(P) 

p 

e 

Thus, optimizing one choice of objective function provides a solution for which the 
value of the second objective function is close to optimal. In practice, the solution 
obtained using the proxy objective function should be much closer than the worst case 
bound given by this analysis. Li particular, the first, third, and fourth inequalities are 
likely to be far from tight. For the third inequality, this is because £ may be large 
when c(P) is very small for some P , but in this case, the total contribution to Eq. (8) 
of the capacity reserved on P is also small. When c(P) is large for some commodity, 
then the other paths for that commodity should also be close in cost. In addition, 
since in the end the fractional solutions are rounded to obtain integer solutions, the 
gain in cost due to using the proxy objective function is small relative to the gain from 
rounding. 

The minimum cost version need not be solved directly, but the 
maximum concurrent flow LP may be solved with an additional budget constraint. 
With an e -approximate method for this problem, one can obtain an approximate 
minimum cost solution by using an efficient search method (such as binary search) to 
find the minimum cost budget. 
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The budgeted linear program (referred to as BCR, for Budgeted 
Concurrent Recovery) and the corresponding dual linear program (DBCR) are now 
presented. 

2>(P) >M k VkeK 

V<=P k 
PeA fc \P" 



z z 

keK P:eeP 



*(p)+ 2> p '(p) 



<u(e)VeeE,VqeQ 



c(p)x(p) + j>(i>'>py'(p) 

P'eA*\P 



<5 



x ? ^ ? A > 0. 



The dual (DCBR) of this linear program is the following: 



min (pB + ^ A 9 (e) 

c(P',P)p + X 2> ? (e) V£ ej fi:,VP',PeA t ,P*P' 

g:P r eA(g)e<=P 

c(P>+ X T, h "& + Z ^ VP eA„ VAre.fi: 

g:P*A(?)eeP P'eA A -{p} 



keK keK P'eA# 

q>,h,w,z > 0 



For ease of notation in the following, if / is a linear function defined on ground set 
£ andPc£ ? then /(P) will be used to denote . 



2,1 An Exemplary Method for Budgeted Concurrent Recovery 

The problem (BCR) is a mixed packing and covering linear program: it 
has form max X 9 Px < p, Cx > Ac, x > 0 for nonnegative matrices P and C and positive 
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vectors p and c. Start with x, y = 0, h q (e) = 5lu(e), and <p = d/B. Given a 
singleton- vector pair (<p,h), let ju p (<p,h) and F p , p be defined as follows: 

iUp (^A):=c(P>+ X ^( P )> 

g:P' 6 P(«r) 

When 0 and /i are clear from context, they are written as ju p and 
Fp ip . The variables z k and ff p are obtained for FeA A by finding an optimal 
solution to the following new linear program, where the constraint (10) is supposed to 
hold for all P, P'eA*, P*P': 



maxd k [Z k +X ^ ( 9 ) 

v/ <v p ,p (10) 

z k + £ w p ' <// p VPeA, (11) 

P' e A,-{P} 

z,w >0 (12) 



Let Z A be the optimal value of this problem; that is, Z k = z k + ^ PeA W v . By 

dividing the solution (<p,h,w,z) by ^ k d k Z k , a feasible solution is obtained to 

(DBCR). Thus for now, the main concern is with only dual variables h and $ . 

As with the previous method, the method for the budgeted concurrent 
recovery problem cycles through the commodities. For commodity k , using current 
vectors h l k _ x and tp l d _ x , an optimal solution to the following problem is found, and 

this solution is used to determine the update step in the current iteration (the second of 
the following three constraints is supposed to hold for all P'eA t ): 
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min 2>p*(P)+ J] v p , p j p '(P) 

Pe A t (P,P')eA 2 A 



subject to the following: 



Z*( p ) ^ d k (13) 

X *(P)+ x y(P) > j 4 

x, y > 0 

Note that the linear program described by Eqs. (9)-(12) is the dual of 
Eq. (13). The optimal solutions to both problems take one of two forms. The 
structure of the optimal solution to both Eq. (13) and Eqs. (9)-(12) is described in 
Lemma 2.2. To begin, two functions are introduced that will be useful to demonstrate 
this. The first is an extension of the Z k introduced in the previous section. 

Z k = min fx v + v p p (\4) 

P^P^A,:^ P *' P 2 

To obtain the second, assume the P. e A k are indexed by increasing 
ju P{ value, so that ju Pi <n Vi <--. 



z *=£fcfL/<*'<r-i) (is) 



Let r k be the value of r that determines Z k in this expression. 

Lemma 2.2: The optimal solution to Eq. (13) has value equal tod k 

minjz^Z* J . If the optimal solution has value d k Z k determined by V x and P 2 , then 
it satisfies x(P x ) = (P 2 ) = d k , with all other variables zero. Otherwise, it has value 
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d k Z k and there are r distinct paths in such that *(P.) = -^- for i = 1,...,/% all 

r-l 

other variables are zero. 

— d 
Proof: Z* < Z* : In this case, it is claimed that x(P f ) = — — for i < r k 

A 

for r k the value that determines Zk , and x(P t ) = 0 for i >r k is an optimal solution. 

A 

This is clearly feasible and has value d k Zk. Now consider the dual solution 

A 

wQ? i ) = Zk- Mi if*"^ r * andZ t =0. This clearly satisfies Eq. (11). ByLemma2.3, 
w(P ; ) > 0 for all i, satisfying Eq. (12). Since Z<Z k < juP t + v PijP , then Eq. (10) is 

A 

satisfied for all P,P'. Thus, the solution is feasible. Since its value is d k Zk, 

matching the value of the above solution to Eq. (13), both primal and dual solutions 
are optimal. 

Z k <Z k : If Z k = J Wp l +v Pi p 2 set x(P x ) = y*(P 2 ) = d k and all other 

primal variables equal to 0. This solution is clearly feasible and has value d k Zk. 
Consider the following dual solution. 

n<P, ) = maxjz, - ^ ,o} V i < \ A k | (16) 

and z k =Zk- X!.>v(P.) . Since Z k <Z k , Lemma 2.3 implies that Z k < ju p for all 
w(P) = 0, and hence this solution satisfies Eq. (11). By definition, w is nonnegative. 
Summing Eq. (16) over all i with w(P.)>0 yields ^w(P / ) = rZ>t -^/^ , which 

implies that z k = ^-£*KP/) == 2> Pf -(r-l^^JX -(r-l)Z A >0, where 

A 

the second inequality follows from the definition of Zk . Finally, Z* < ^ p + v P P for 
all i and y. Hence Eq. (10) is satisfied, and the solution is feasible. It has value 
d k Zk , matching the given solution to its dual Eq. (13), hence both are optimal. 
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A A 

Lemma 23: Zk > ^ for i < r k and Z* < // P for / > r k . 

Proof: Let Z*(/) "XLi^. * ^ or ease °^ nota ti° n > M% is written 

A 

as // z - for the remainder of the proof. The lemma follows by noting that Z*(r + 1) is a 

A 

convex combination of Zk (r) and . Thus, fa for j > 2 is included in definition 

A A A 

of Zk if and only if p. < Zk{i-\) . Since < jUj for z < 7 , if fi t < Zk{i - 1) , then 

A 

/i t <Z k (j) for all j>i. 

Lemma 2.2 implies that the optimal solution to Eq. (13) may be 
described as a set of paths, each carrying an equal amount of flow. Without loss of 

A 

generality, assume this set is {P 1V ..,P }, where r k >2. (If Z*<Z, this set is 
{Pj,P 2 } and J* =2.) The amount of flow sent along each path is u where 

d B 
u := min{ — — ,min /Sr t*(P/), } . This quantity u is set so that the left side 

of an inequality in (BCR) does not increase by more than the fixed value of the right 
side in any one step. In practice, 2] PeA c(P) is significantly smaller than B , so that, 

when combined with scaling of capacities, u is determined by d k . The updates to the 

dual variables for each case are described in the method given below. The stopping 
criterion is the same as in Garg, which is incorporated by reference above. 

Budget-LP (G = (A, E, c 9 u 9 d)) 



Initialize h q {e) = S/u(e) VeeE,VqeQ 
Initialize <p = d/ B 

Initialize x = 0,y^0 
while D{h) < 1 

for k = 1 to \K\ do 
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while land d'>0 

Z k =min{T k J k }. 

Let {P 1? P 2 ,...,P^ } be paths achieving the 
optimum Z k . 

c< 5_ 

«o<-minf-^-,min 

^ 1 

8Uq_ 

cp<r-(pe c 

d'<r-d'-(r k -l)u 0 

x(P,)<-x(P 1 ) + Mo 

for q^Q 

if Pj g A(# ) do 

V^P 1? ^( e )f-F(^ w(e) . 
else do 

g«o 

VeeP 2 ,A"(e)<-^(e> a(e) . 

end for 
else (Z k =Z k ), 
for z<r t , 
x(P.)<-*(P,.) + Mo 
for e e P, q such that P,. «s A(q\ 

/? 9 (e)<-^(e)e u(e) . 
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end for 

end for 
end while 

end for 
end while 

2.2 Analysis 

The extended arguments for X > — - — - — are similar to the previous 

log ee (l/<S) 

arguments. It is now shown that the primal solution divided by this choice of X after 
t iterations is indeed an e -approximate solution. 

Theorem 2.4: Suppose the optimal primal (and hence dual) solution 
value is at least 1. The primal solution at the end of the (t-l)st iteration divided by 

log^llS is an -approximate solution, for £ f =ce, for an appropriate constant 
c> 0 , and appropriate choice of 3 . 

Proof: Let u t and <j> i be the dual variables at the beginning of the i th 

iteration. Let h i k s and q> i k s be the dual variables before routing the k th commodity 
in the i th iteration. Let h ik s and <p i k s be the dual variables before routing the s th pair 
of paths for commodity k in the i* iteration. Let D(i,k,s) be the value of the dual 
objective function using dual variables h i k s and q> itkjS . 

Let Z k (h, (p) be the value of Z k computed using the given values of h 
and <p before the rescaling necessary to make the dual feasible. Equivalently, 

Z k {h,<p) = rmn{Zk(h,<p),Zk(h,(p)} . Let a(h,q>) equal Zs keK d k Z k (h,q>). The variable 
a has been designed so that (h, (p) divided by a(h, (p) is a feasible dual solution with 
value D(h 9 <p)/a(h,q>) . Let D(i,k,s) be the value of the dual solution with variables 
h i k s and g> i k s . Below, it is established that 



D{U k,s + l)< D{\ 9 K s) + ue(l + s)Z(h i>kMs ) 
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holds no matter how Z k is determined. This is done by analyzing each case 

separately. Once Eq. (17) is established, the remaining argument mirrors the 
argument in the proof of Theorem 1 .2, so this portion of the proof is omitted here. 

Suppose Z jk (* if fc >f ,ftj fcff ) = Zj:(*, ijfc>f ,^ (f ). Let 5 and S' be as in Eqs, 

(3) and (4). After u is routed on V x and P 2 as the s th pair of primary and backup 
paths for commodity k , the following may be determined: 



I 10 = D(i,k, S ) + B<p^(e™< c «» c ™' B -V) + 

^ Thus, letting 7 = c(P Y ) + c(P 2 ), the following may be determined: 

fit 15 D(Uk,s + \)<D(i,k,s) + euy<p iks +s 2 u 2 y 2 / B+ n . 

X /&>)(« e + £ V/ W 00) + + eV /«(*)) 

< Z)(i, *, j) + ue(l + e)<p. kJ + ue(l + e) £ ^ (e) (19) 
= fc, s) + we(l + e)j> Pi + ] 

- D(i^s) + ue(l^8)Z(h ik)S9 (p iJc>s ) y (20) 

where inequality (18) uses the fact that e a <l + a + a 2 for 0<a<l; inequality (19) 
uses the fact that u <min {w(P l ),w(P 2 ) ? } ; and inequality (20) follows from 

) C (^2 ) 

the determination of Z k as in (14). 



20 
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Next, suppose Z k (h hkJ ,<p iXs ) = Z k {h ik ^(p ijcs ) . For ease of notation, 
the following are defined: 



(21) 



7=1 



if/ ^eu 
rj = urj /(r - 1) . 



J>(p,) /(r-1), and 



(Note that the union in Eq. (21) is a union of r pairwise disjoint sets, since the paths 
Pj are pairwise link-disjoint.) After u/(r-l) is routed in the i th iteration in the s th 

A 

round for commodity k on the cheapest r k paths determined by Zt, the following 
may be developed: 



D(i,k,s+\) = + 



Z)(a,*) + ^ IAi («- /c -l)+ 2 u(e)h« kiS (eXe>"^ -1) 

(e, ? )eS" 

< £>(/, *, 5) + ^ {y/ + y/ 2 /B)) + 

< D(i,k,s) + r,(l + e) j>P y (^ A,m) 
= D(i, k 9 s) + + e)Z, (A a> , , <p^ ) 



(22) 

(23) 
(24) 



where inequality (22) uses the fact that e a <l + a + a 2 for 0<a<l; inequality (20) 
uses the fact that u <min{(r-l)w(P y .}c}; and inequality (24) follows from the 

A A 

determination of Z* as in (15), the fact that Z k =Z k , and the choice of r and P. . 
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3. Commodity-Dedicated Capacity Reservation 

In this section, a simplified version of (MOC) is considered that insists 
on dedicated reserve capacity for each commodity. In (MOC), capacity used by a 
backup path for a failure affecting commodity k could be used by a different 
commodity when a different failure occurs. This more sophisticated model allows for 
better utilization of network capacity, hi current network models, however, this is not 
done. Instead, each commodity has dedicated reserve capacity. That is, if any path 
fails, then there is enough spare capacity reserved for that commodity to route all the 
demand for that commodity. Since there is no transfer of flow from one path to 
another in case of failure, another commonly used objective function is considered 
here: letting f(e) be the total flow on link e, the total cost of the flow, i.e., 

c{e)f{e\ is minimized. This problem and its variant, where the paths are 

chosen on the fly, has been studied by Bienstock and Muratore in the context of 
capacity expansion with integrality constraints. See Bienstock and Muratore, "Strong 
Inequalities for Capacitated Survivable Network Design Problems," Math. 
Programming 89, 127-147 (2001), the disclosure of which is hereby incorporated by 
reference. 

An e - approximation scheme is described here to solve the 
corresponding linear program for this dedicated reservation problem by observing that 
this problem corresponds to the special case of the previous problem with all y 

A 

variables forced to zero. In this case, Z k is always determined by Zk . Likewise, the 
h q (e) variables are replaced by a single variable h(e\ and z k =0 so may also be 

omitted. This modification in linear program formulation is highlighted below. The 
method and analysis then follow from the previous section, so are omitted. The 
dedicated capacity concurrent recovery problem: 

min £c(e)/(e) 
£jc(P)>^ Vk € JsT,P' e A k (25) 

PeA t \{p'} 
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J>(P)</(e) VeeE 



PieeP 



f(e) < u(e) \/e<zE 
jc(P) > 0 

It is possible to remove the variables /(e) from this formulation to 
obtain an equivalent formulation: 

min ]Tc(P);c(P) 
]T x(P) >d k V£ e K, P' e P k A 

PeA t \{p) 

2>(P)<wO) VeeE 

P:e«=P 

x(P)>0 

This is solved by solving the concurrent flow version with a budget 
constraint and then searching for the optimal budget. The corresponding primal and 
dual LP's are 



max Ad k 



^x(P)<u(e) VeeE 

P:eeP 
PeA 

JC(P)>0 



The dual LP is as follows, where the first constraint is supposed to hold 
for all k e K and for all PeA fc : 
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min B<fi + ^u(e)h(e) 
c(P)<p + ^h(e)> 

/,w>0 

Substituting Z fc = ]T PeAi w p , then the dual can be rewritten as follows (once again, 
with the first constraint is supposed to hold for all k <= K and for all PeAJ: 

min B(p + ^u(e)h(e) 

eeE 

w* +c(?)(p + h(?)>Z k 
^d k Z k >l 

keK 

l,w>0 

As mentioned above, the algorithm and analysis now follow from the 
previous section. 

It is to be understood that the embodiments and variations shown and 
described herein are merely illustrative of the principles of this invention and that 
various modifications may be implemented by those skilled in the art without 
departing from the scope and spirit of the invention. 



